[Bugfix] [Numpy] Add `kAddTo` and kNullOp to Transpose #16979

sxjscience · 2019-12-05T07:05:27Z

Description

Add kAddTo and kNullOp to np.transpose and fix [Numpy] Transpose does not support kAddTo and kNullOp #16789
Fix the problem that the previous np.tranpose support duplicated axis:

Previously, we can run the following code without any problem:

import mxnet as mx
mx.npx.set_np()
a = mx.np.ones((3, 4, 5))
b = mx.np.transpose(a, axes=(0, 0, 1))
print(b.asnumpy())

Now, we will raise an MXNetError

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

kAddTo and kNullOp to transpose, test

Check for repeated axes enable addto to transpose fix fix fix fix remove unused ndim Update pseudo2DTranspose_op-inl.cuh Update pseudo2DTranspose_op-inl.cuh Update pseudo2DTranspose_op-inl.cuh fix Update pseudo2DTranspose_op-inl.cuh try to fix Update pseudo2DTranspose_op-inl.cuh Update pseudo2DTranspose_op-inl.cuh Update pseudo2DTranspose_op-inl.cuh fix Update np_matrix_op.cc Update test_numpy_op.py update test case fix implementation fix bug update fix bug Update pseudo2DTranspose_op-inl.cuh fix fix Update test_numpy_op.py

sxjscience · 2019-12-05T07:08:53Z

@leezu @zburning This should fix the XLNet problem.

TaoLv · 2019-12-05T07:15:48Z

src/operator/tensor/matrix_op.cc

  CHECK_EQ(inputs.size(), 1U);
  CHECK_EQ(outputs.size(), 1U);

-  if (SupportMKLDNNTranspose(param, inputs[0])) {
+  if (SupportMKLDNNTranspose(param, inputs[0]) && req[0] == kWriteTo) {


sxjscience · 2019-12-05T08:25:38Z

src/operator/tensor/pseudo2DTranspose_op-inl.cuh

-
-template <typename DType, typename CType>
+/*!
+ * \brief The `transpose_pseudo2D` based on chosen vectorized types. It transpose an array of


@ptrendx I've also added the doc here for the transpose_pseudo2D. Correct me if you find any problem.

src/operator/tensor/pseudo2DTranspose_op-inl.cuh

ptrendx · 2019-12-05T17:11:52Z

src/operator/tensor/pseudo2DTranspose_op-inl.cuh

        #pragma unroll
        for (index_t i = 0; i < TSR; i++) {
+          DType* tmp_dptr = reinterpret_cast<DType*>(&tmp[i]);


Hmmm, I'm not sure if this will still be handled properly by the compiler, let me test that.

Have you find any problem? I can compile that and it also passed CI.

Hi, I only now had a chance to look into it. No, there is not any problem - I was worried that the compiler could get confused by this and put the tmp array in local memory instead of registers, but I tested and it does not do it.

Great! Thanks for clarifying 👍

reminisce

LGTM.